Best Video Generation Large Model AI Tools & Models - Premium Video Generation Large Model News

AI News

LobsterAI Launches Image and Video Large Model Matrix, Integrating Four Mainstream Image and Video Generation Models at Once

Domestic AIGC multimodal creation field has made new progress, with the open-source AI product LobsterAI (Lobster) under NetEase Youdao upgraded and officially launched image and video generation capabilities. This upgrade adopts a matrix integration strategy, integrating four mainstream multimodal large models: Seedream, Seedance, HappyHorse, and MiniMax-Hailuo, enhancing creative efficiency and diversity.

15.4k 6 hours ago

No Rehearsal, Real Fight on Stage! Meituan LongCat-Video-Avatar1.5 Open-Sourced: Fully Outperforming Mainstream Closed-Source Models

The Meituan Dragon Cat large model team has open-sourced the commercial-grade digital human video generation model LongCat-Video-Avatar1.5, achieving a leap from open-source SOTA to commercial application. This version significantly improves in core dimensions such as lip synchronization, physical plausibility, long video stability, multi-person interaction, and efficient inference, aiming to solve the pain points of traditional digital human videos and promote the application of digital humans toward realistic scenarios tailored for individuals.

16k 3 days ago

No Rehearsal, Real Fight on Stage! Meituan LongCat-Video-Avatar1.5 Open-Sourced: Fully Outperforming Mainstream Closed-Source Models

Aliyun BaiLian Launches Major Upgrade: Full-Stack Open Access, Building an AI Model Supermarket

On May 20th, during the summit, Alibaba Cloud announced that its large model service platform "BaiLian" has strengthened its open ecosystem, integrating top third-party models from multiple companies, covering fields such as text, image, video, and multimodal generation. This move marks BaiLian's transformation from a showcase for Alibaba's self-developed Qianwen model into an AI model supermarket that includes mainstream models across industries. The first batch of integrated model portfolios is rich and diverse.

18.5k 1 days ago

Kuaishou Technology Board Evaluates Restructuring of Ke Ling AI Business, Possible Introduction of External Financing

Kuaishou Technology announced that its board is evaluating a restructuring plan for Kuaishou Ling (Kling) AI assets, potentially involving external financing. Kling is Kuaishou's self-developed video generation large model, expected to launch in June 2024. On January 31 this year, the Kling 3.0 series was released, including image, video, and Omni versions, with technical upgrades offering richer content.....

14.4k 2 days ago

AI Products

Vidu Q1

Vidu Q1, a domestically produced video generation large language model, supports high-definition 1080p video generation and offers exceptional value for money.

Video generation

9.7k

Wan2.1

Wan2.1 is an open-source, advanced, large-scale video generation model supporting various video generation tasks.

Video generation

12.8k

Luma Ray2

A large-scale video generation model capable of creating realistic visual effects and naturally coherent motions.

Video generation

12.6k

HunyuanVideo

A large-scale video generation model training framework open-sourced by Tencent.

Video generation

11.3k

Models

Gemini 2.0 Flash-Lite

Google

$0.49

Input tokens/M

$2.1

Output tokens/M

Context Length

GPT-4.1 mini

Openai

$2.8

Input tokens/M

$11.2

Output tokens/M

Context Length

Grok 4 Fast

Xai

$1.4

Input tokens/M

$3.5

Output tokens/M

Context Length

o3-mini

Openai

$7.7

Input tokens/M

$30.8

Output tokens/M

200

Context Length

GPT-5 Codex

Openai

Input tokens/M

Output tokens/M

Context Length

Claude 3 Opus

Anthropic

$105

Input tokens/M

$525

Output tokens/M

200

Context Length

Gemini 2.0 Flash

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

Claude Haiku 4.5

Anthropic

Input tokens/M

$35

Output tokens/M

200

Context Length

Gemini 2.5 Flash

Google

$2.1

Input tokens/M

$17.5

Output tokens/M

Context Length

Claude Sonnet 4.5

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

Claude 3 Sonnet

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

Gemini 2.5 Flash-Lite

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

qwen3-vl-235b-a22b-thinking

Alibaba

Input tokens/M

$20

Output tokens/M

Context Length

qwen3-coder-plus

Alibaba

Input tokens/M

$16

Output tokens/M

Context Length

wan2.5-i2i-preview

Alibaba

Input tokens/M

Output tokens/M

Context Length

Qianfan-Lightning

Baidu

Input tokens/M

Output tokens/M

128

Context Length

qwen3-max

Alibaba

Input tokens/M

$24

Output tokens/M

256

Context Length

qwen3-vl-plus

Alibaba

Input tokens/M

$10

Output tokens/M

256

Context Length

qwen-image-plus

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen-image-edit

Alibaba

Input tokens/M

Output tokens/M

Context Length

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AIBase LLM Leaderboard AI Ranking

Business Cooperation Site Map

AI News

LobsterAI Launches Image and Video Large Model Matrix, Integrating Four Mainstream Image and Video Generation Models at Once

No Rehearsal, Real Fight on Stage! Meituan LongCat-Video-Avatar1.5 Open-Sourced: Fully Outperforming Mainstream Closed-Source Models

Aliyun BaiLian Launches Major Upgrade: Full-Stack Open Access, Building an AI Model Supermarket

Kuaishou Technology Board Evaluates Restructuring of Ke Ling AI Business, Possible Introduction of External Financing

AI Products

Vidu Q1

Wan2.1

Luma Ray2

HunyuanVideo

Models

Gemini 2.0 Flash-Lite

GPT-4.1 mini

Grok 4 Fast

o3-mini

GPT-5 Codex

Claude 3 Opus

Gemini 2.0 Flash

Claude Haiku 4.5

Gemini 2.5 Flash

Claude Sonnet 4.5

Claude 3 Sonnet

Gemini 2.5 Flash-Lite

qwen3-vl-235b-a22b-thinking

qwen3-coder-plus

wan2.5-i2i-preview

Qianfan-Lightning

qwen3-max

qwen3-vl-plus

qwen-image-plus

qwen-image-edit

Sam3

Qwen3 VL 30B A3B Thinking GGUF

Qwen3 VL 8B Thinking AWQ 4bit

SFWan2.2 T2V A14B Diffusers

Qwen3 VL 235B A22B Instruct

Wan2.1 T2V 14B

VideoLLaMA2.1 7B AV CoT

Text2Motion

Wan2.1 T2V 14B

HunyuanVideoGP HFIE

HunyuanVideo

AuroraCap 7B VID Xtuner

CogVideoX 5b

Videollm Online 8b V1plus

Text To Video Lvd Ms